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S ( (QUERY OR SEARCH) (5W) (SENTENCE OR CLAUSE OR STATEMENT OR INSTRUCTION OR C 

S ((QUERY OR SEARCH) (5W) {SENTENCE OR CLAUSE OR STATEMENT OR INSTRUCTION OR CO 

SUBJECT) (S)URL) 
Processing 

70087 QUERY 
1060660 SEARCH 
93593 SENTENCE 
67476 CLAUSE 
997976 STATEMENT 
166588 INSTRUCTION 
341414 COMMAND 
106Q660 SEARCH 
449524 ENGINE 

265 (QUERY OR SEARCH) ( 5W) ((( (SENTENCE OR CLAUSE) OR 

STATEMENT) OR INSTRUCTION) OR COMMAND) ( 3S) SEARCH (W) ENGINE 
28796 KEYWORD 
90243 PHRASE 
1357-258 SUBJECT 
47-615 URL 

1570 ((KEYWORD OR PHRASE) OR SUBJECT) ( S ) URL 
S4 "35 ((QUERY OR SEARCH) (5W) (SENTENCE OR CLAUSE OR STATEMENT OR 

INSTRUCTION OR COMMAND) )( 3S )( SEARCH (W) ENGINE ) AND 
((KEYWORD OR PHRASE OR SUBJECT) (S)URL) 
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Set Items Description 

51 30 ((QUERY OR SEARCH) (5W) (SENTENCE OR CLAUSE OR STATEMENT OR - 

INSTRUCTION OR COMMAND)) (3S) (SEARCH (W) ENGINE) (S)URL 

52 11 SI NOT PY > 1998 

53 10 RD (unique items) 

54 ' 35 ((QUERY OR SEARCH) (5W) (SENTENCE OR CLAUSE OR STATEMENT OR - 

INSTRUCTION OR COMMAND) )( 3S )( SEARCH (W) ENGINE) AND ((KEYWORD OR 
PHRASE OR SUBJECT) (S) URL) 

55 8 S4 NOT PY > 1998 

56 7 RD (unique items) 

57 138 ( (TRANSFER OR TRANSMISSION OR TRANSMIT OR DOWNLOAD OR REND- 

ER) (W) (SPEED OR TIME OR RATE)) AND ((REQUEST OR REQUESTED) (S)- 
BASED(S) (WEB (W) PAGE OR URL) (S)SIZE) 

58 0 S7 NOT PY > 1997 

59 336 ( (TRANSFER OR TRANSMISSION OR TRANSMIT OR DOWNLOAD OR REND- 

ER). (W) (SPEED OR TIME OR RATE)) AND ((REQUEST OR REQUESTED) (S)- 
BASED(S) (WEB (W) PAGE OR URL OR OBJECT OR DOCUMENT) (S)SIZE) AND 
CLIENT (S) SERVER 

510 130 S9 NOT PY > 1998 

511 130 RD (unique items) 
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»>SET HILIGHT: use ON, 

KWIC is set to 50. 

112536 
17069 
43810 
123906 
5112 
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62197 
98830 
388^415 
r6.70 
338.308 
2897 
231-055 
108392 
3949 
1469 
SI 2 



OFF, or 1-5 characters 



SEARCH 
QUERY 
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FORMULATION 
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CLAUSE 
INSTRUCTION 
COMMAND 
ACTION 
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RETURN 
RETURNING 
URL 
URLS 
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RETURNING) (5W) (URL OR URLS) 
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INTERNATIONAL PATENT CLASS: G06F-017/30 ^ 
ABSTRACT WORD COUNT: 144 
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FULLTEXT AVAILABILITY: 

Available Text Language Update Word Count 

CLAIMS A (English) 9806 569 

SPEC A (English) 9806 5027 
Total word count - document A 5596 
Total word count - docxament B 0 
Total word count - documents A + B 5596 

...SPECIFICATION the secondary storage. Typically the memory areas share 
the hard disk 110 with the database 112. The areas are for:- a category 
list 200, a keyword list 205, a URL store 210, a document store 215, 
a report store 220, a search engine list 225, search engine syntaxes 230, 
search result files 235 and final URLs 240. Of these areas, the data in 
the category list 200, the keyword list 205, the search engine list 225 
and the search engine syntaxes 230 is defined before the operation of the 
present embodiment. The remaining areas... 

...the search engines, retrieving the results and processing the results is 
described in more detail below with reference to Figure 5 and Figure 6. 

The URIi lists produced by the search engines are then combined in 
step 312 to form a single list of URLs. This step includes removing 
duplicate URLs where more than one search engine has raised the same URL 

for the same word. Obviously, this step need only be carried out if more 
than one search engine is used. The result of this is effectively a list 
of URLs for each keyword 

Then, in step 316, the list of URLs (which is arranged by keyword ) is 
cross-referenced back with the original description of categories and 
keywords, generated in step 302, to identify those URLs which are 
candidates for each category. The URLs for each keyword in each 
category are then filtered, in step 320, to remove pages which are not_to 
be processed. Pages which are commonly removed at this point include 
non-http references and other non-promising sites such as foreign 
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PRIORITY: 7-004597, JP (Japan), January 17, 1995 (19950117) 
FULL TEXT: 1624 lines 



...draft-ietf-http-vl0-spec-04.html, IETF HTTP Working Group, October 1995. 

[2] T. Berners-Lee, L. Masinter, and M. McCahill: "Uniform Resource 
Locators ( URL )", Request For Comments rfcl738.txt, anonymous ftp from 
ds. internic. net/rfc, December 1994. 

[3] "Frequently Asked Questions About Lycos", URL 

http: //lycos. cs.cmu.edu/reference/faq.html, Lycos Inc. , 1995. 

[4] "Netscape Navigator", URL http://www.mcom.com/. Netscape, 1995. 

[5] C. Weider, P. Faltstrom, R, Schoultz: "How to interact with a 
Whois++mesh", IETF Internet Draft draf t-ietf ... found, all of the resources 
in the same, group can be retrieved), and to improve search quality 
(resources in the group might not match the keyword description but might 
still be relevant) . 

Historically, each resource is grouped into a single, or at best, small 
number of groups. Doing so reduces the... 

... Note that, taken by itself, there is nothing novel about the 
functionality of a single link server. It is similar to the functionality 
of any search engine . It is the total combined functionality of all 
link servers spread over multiple networked computers that creates a 
working Ingrid topology. 

3.1,2 Embodiment... or identical to any other resource in the resources - * " 
list. 

The operation according to the flow chart shown in FIG. 7 is executed by 
the search client, as follows. 

First, a command to search for resources with a term combination TCg is 
received (step 701), and the resources list and the queried list 
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...ABSTRACT: Successful searching of the Internet depends on techniques 
such as going straight to the information source, guessing URLs, and 
developing strategies for when to use siabject directories and search 
engines. If the information needed is of the type likely to be made 
available on the Web, then it is just a matter of determining who would 
make it available. In guessing the URL , it is important to remember that 
both Netscape Navigator and Microsoft's Internet Explorer automatically 
take a host address and add the common http:/ at... 

...TEXT: organization for specific information is a starting point, but 
then the searcher needs to locate the organization's Internet presence, 
i.e., to find the URL . The subject directories and larger search 
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engines can certainly be used for this task, but before spending time with 
those, try some simple URL guesswork. For many Web sites, the unofficial 
standard of the www.company.com address may easily take a searcher directly 
to the top-level Web. . .pages buried inside a Web site. The search engines 
can be used for tracking down top-level pages for organizations when 
neither guessing nor the subject directories help, but they require a 
different approach. Search for the organization's name as a phrase , but 
expect to see many subsidiary pages from the organization's Web site or 
links to it from sites other than the top-level page. That top-level page 
may be in the retrieval set, but it is often ranked low. However, simply 
take the subsidiary pages' URL to determine the root URL for the main 
site . 

Unique keywords. Unique keywords, especially ones that uniquely identify a 
topic, are much better suited to large search engine queries. While 
even unique keywords may bring up many hits, certain kinds of keywords can 
very effectively narrow the search. Drug names, unique product names... 
search results in the large Web databases. AltaVista, Infoseek, and 
OpenText all support field searching. The most useful of the fields 
available are title and URL . For example, if you or a patron remember 
seeing a recall notice about a product from Defective Company, structure a 
search that searches for pages with the keyword recall and the URL 
www.defective.com- In the AltaVista simple search and Infoseek, the 
search statement would read +recall + url : www; defective .com. An easy 

way to narrow a search is to require the most important word to appear in 
the title . 

Limits. A similar... 
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Zorn, Peggy; Marshall, Lucy; Panek, Mary 
Online v21n3 PP: 30-51 May/Jun 1997 
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WORD COUNT: 8051 

...TEXT: as supported by volunteers. One part of this software group is a 
searching facility that can be applied to both Internet and intranet use. 

The search engine software is called Glimpse; it is available and 

supported in the forms of Glimpse, GlirapseHTTP, and WebGlimpse. In order to 
use Glimpse on a web... 

. . . once, which GlimpseHTTP cannot do. This section of the evaluations will 
focus on WebGlimpse, since it is the most appealing to those considering an 
intranet search engine 

When installed, WebGlimpse inserts a search box at the bottom of every HTML 
page specified- The search box can be set to search the entire.. . 
. . . someone trying to find Web pages containing the phrase "Arizona Desert" 
and the word "Windsurfing" would have to type in Arizona desert; windsurfing 
as a search command 

The advanced searching page also includes options for case-sensitive 
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..•SPECIFICATION way in which search engines are used to generate URL lists 
is now described. Search engines are commonly used in association with 
the WWW. A search engine typically carries out a search in response 
tO;,an http command . Essentially, such a command comprises a URL for 
the search engine and a fixed syntax specifying the search to be 
carried out by the search engine at that URL . Typically, the syntax 
for the same command for different search engines varies and needs to be 
determined before successful searching in this manner can be achieved on 
different search engines. The following http command example tells 
the search engine "AltaVista" to search for the keyword "slide": 

http: //altavista. digital .com/cgi-bin/query?pg=q&what=web&fmt=. &q=slide . 
Obviously, it would be... 
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Fulltext Availability: 
Detailed Description 



Detailed Description 

... link with the ISP; 2. A'browser" application program, e.g., 

Netscapes", is executed on the user's PC; 3. A connection to an Internet 
" search engine " website, e.g., Yahoo"', Web Crawlers', Infoseeksm, 
and the like, is executed by the user through the browser, thereby 
establishing a connection between the user's PC and the search engine 
websRe; & Once the search engine website connection is made, a 
search engine input screen is then displayed on the user's PC monitor 
while the search engine is idle in an "input wait mode"; 5. A search 
strategy in the form of key word search terms is provided by the user as 
textual input to the search engine on the search engine^ input 
screen through the PC's keyboard; 6. In turn, the search engine awaRs 
a"finished" input command from the user in the form of a mouse click of 
an icon control button or alternatively a stroke of the "enter" key on 
the keyboard; 7. The user's search strategy request is then delivered 
onto the world wide web through the search engine ; 8. A list of 
Internet websites (search hits or citations) and corresponding URLs 
(Universal Resource Locators) or website addresses are received and 
displayed on the user's monitor; and 9, The user may then select a 
particular URL of interest by a "mouse click" which causes information 
associated with the URL to be subsequently retrieved and displayed for 
digestion by the user, and for further inquiries. 

Alternatively to the foregoing scenario, if the URL or "website... 
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...TEXT: The list includes well-known tools such as Alta Vista, HotBot, 
Lycos, and InfoSeek, along with less well-known Web indexes such as RBSE's 
URL Database, Mesch, and TVhoy! Most of the search forms are similar, 
featuring a search button and a search statement box. Some search 
engines also have options to select, such as the number of records to 
display or the kind of search to run. 

Another. . - 
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. (USE FORMAT 7 OR 9 FOR FULLTEXT) 

noticed the funny looking set of letters at the bottom of many 
print and TV ads and programs? WWW something or other? These is a URL , a 
unique combination of keyboard characters that identifies a Web site among 
the millions of others. Knowing the URL of a particular site is all you 
need to get there on your PC. If you don't know the URL , enter a key word 
into a Web search engine , which will produce a list of relevant URLs 
for you automatically. Click on the one you're looking for, and you're 
there ! 

Something for . . . 
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51 2 ((SEARCH OR QUERY) (5W) (STATEMENT OR FORMULATION OR SENTENCE 

OR CLAUSE OR INSTRUCTION OR COMMAND) (S) (ACTION OR VERB OR SU- 
BJECT OR KEYWORD)) (S) (RETURN OR RETURNING) (5W) (URL OR URLS) 

52 709 (TRANSFER OR TRANSMISSION OR DOWNLOAD OR DOWNLOADING OR RE- 

NDER OR RENDERING OR PROPAGATION OR RESPONSE) (W) (TIME OR RATE 
OR SPEED) AND (BASED (S) (URL OR PAGE OR DOCUMENT OR OBJECT) (3W- 
) (SIZE OR CONTENT) ) 

53 f 312 S2 NOT PY > 1997 

54 164 (TRANSFER OR TRANSMISSION OR DOWNLOAD OR DOWNLOADING OR RE- 

NDER OR RENDERING OR PROPAGATION OR RESPONSE) (W) (TIME OR RATE 
OR SPEED) AND (BASED (S) (URL OR PAGE OR DOCUMENT OR OBJECT) (3W- 
) (SIZE OR CONTENT)) AND CLIENT (S) SERVER 
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as attachments ... If . they must be manually processed by the user, the user 
is forced to periodically check for receipt of the acknowledgment 
message, then take appropriate action if it has not been received. 
Automatic acknowledgment processing shifts this burden entirely to the 
provider and consumer programs 12, 22. The user can simply... the 
notification element 143, and because of the structured format of the 
message data contained in the communications object. Notification methods 
141 may trigger any action available to the consumer program 22, 
sxabject to the user's permissions. 

FIG. 4 illustrates two typical notification methods assigned to an 
element preference instance 22 1. A SendEMail method 224 causes... 
provider or drawn from the consumer database 21 or provider database 11. 
Any communications object component stored in either of these databases 
may be included, subject to the consumer's or provider's data access 
rules discu3sed below. The only difference between input forms produced 
by data exchange methods and standard. server maintained by the 
provider. These queries are easily set up because they can be composed 
using any data avai;Lable in the consumer database 21 ( sxabject to the 
consumer's data access rules, as explained above), so the consumer need 
only enter any new data required. The queries are easily automated... 
example, they can be sorted by their parent communications object (I 10. 
FIG. 3), by, related notification element (201, FIG. 4), by the type of 
action required by the provider, or by any other element contained in 
the message'., object which produced them. Within the notification report, 
-99 message' elements can ... associated with an HTML page, automatically. A 
link element 143, on the other hand, can designate any number of 
resources to be retreived in one action . It can supply these resources 
through its own attributes. 
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... witihin the database storage 32, and when it finds one having a string 
portion equal to the received simplified network address <patent-search> it 
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will return the corresponding URL portion <patents . uspto . gov> . The 
database storage is searched using a search engine 34 which accepts the 
received simplified network address 36 and uses the... a request to the 
HatchUSA server which is a specific example of a server implementing a 
conversion database 258. The database 258 is asked to return a URL 266 
which corresponds to the simplified network address <888>. The HatchUSA 
server 258 returns a fully formatted network access command 
</www. hatch. CO. jp/> as... search?p=hatch> . This URL causes the browser to 
contact the search engine at <search. yahoo. co.jp> and to initiate a search 
for the keyword <hatch>. The search engine will return the results of the 
search' back* to the browser. With ...FIG. 13, the user inputs a simplified 
network address <yahoo : //olympio and the YAHOO! (r) Internet search 
database returns the results of search on the keyword <olympic>. Since 
YAHOO! (r) does not recognize the string <yahoo : //olympio, the message 
aliasing process is used to construct a network access command that is... 
alias message string. The alias message string is properly formatted to 
command the YAHOO! (r) database to return the results of a search on the 
keyword <olympic>. The properly formatted network access command is sent 
to the YAHOO!, (r) database 284 via the network connection 282. The database 
server 284 returns... 

...completing the simplified server commanding process. 

There are an increasing number of network servers capable of responding 
to predefined commands such as the <http://URL/ . . . / search 
?p=parameters/> command defined for the YAHOO! (r) system. Presently only 
the <http://URL/ .../>, <mailto : //email address/>, <ftp:// . . . /> and 
<news: //newsgroup address/> are defined. As new server commands... 
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Detailed Description 
... the object list. 

The search form 620, as illustrated in FIG. 14 will be used as an example 
for processing of a form request. The search forni 620 presents the 
user with a. screen which allows ...to include a message about the new 
object in the user's notification report (including its size, methods, 
update intervals, etc.), or any other notification action or 
combination of actions. Notification preferences and methods are further 
described below. Also, different actions may be taken based upon the 
program state and operation. .. the original communications object 110. 
Alternatively, method names can be polymorphic. In this case a method 
included in--.the communications object 110 could perform one action when 
received by the consumer program 22, but another action when called by 
a message object in the provider program 12. The method can distinguish 
between these programs by matching the system ID of its. ..mail options 
can invoke a SendEMail method 143 which can obtain from the element 143 
all data necessary to address the e-mail, specify the siabject line or 
siabject line subsections, add other carbon copy or blind carbon copy 
recipients, and include any additional data in the body of the message or 
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